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TECHNICAL FIELD 

This invention relates generally to polypeptides modified by the attachment of compounds having 
sulfhydryl reactive groups, improved methods for producing such modified polypeptides and improved 
5 compositions containing them. The invention relates particularly to three modified polypeptides (IL-3, G-CSF 
and EPO), to which sulfhydryl reactive compounds, including polymers, may be attached at selected 
positions in the polypeptide that have been modified by the insertion of cysteine residues or the substitution 
of cysteine residues for other residues. 

10 BACKGROUND 

The desirability of modifying biologically active and therapeutically useful polypeptides with a variety of 
compounds, such as the hydrophllic polymer polyethylene glycol (PEG), to enhance their pharmacokinetic 
properties has been noted. See, e.g., the discussion of the art in this area of polypeptide modification in 

75 published PCT patent application WO87/00056, in U.S. Pat. No. 4,179,337. which discloses conjugating 
water soluble polypeptides such as enzymes and insulin to PEG or PPG, and in U.S. Pat. No. 4,766.106. 
which discloses conjugating ordinarily water insoluble beta-interferon, interleukin-2. or immunotoxins to PEG 
homopoiymers or polyoxyethylated glycerol. Such modification can reduce adverse immune response to 
the polypeptide, increase the solubility for use in pharmaceutical preparations and maintain a desirable 

20 circulatory level of such polypeptide for therapeutic efficacy. 

One problem not addressed by the art in this area involves the extent to which a polypeptide can be 
modified by attachment of compounds having reactive groups that will covalently bond to certain amino 
acid residues of the polypeptide. For example, modification of a polypeptide with PEG or similar polymers, 
can result in random attachment of the polymer at the amino terminus of the polypeptide and/or at one or 

25 more lysine residues in the amino acid sequence of the protein. Because more than one PEG group can 
attach to the polypeptide, the resultant composition may contain a heterogeneous mixture of "PEGylated" 
polypeptide: some polypeptides having only one PEGylated site, others having more than one PEGylated 
site. Such heterogeneity In composition is undesirable for pharmaceutical use. Furthermore, the non- 
specificity with regard to the site(s) of attachment of compounds such as PEG to the polypeptide can result 

30 in loss of biological efficacy of the polypeptide stemming from undesirable attachment to a polypeptide site 
required for biological activity. United States Patent 4.904,584 addresses the foregoing by providing 
materials and methods for site specific covalent modification of polypeptides by lysine insertion, removal, 
and/or replacement. However, we have detemnined that the use of lysine as the attachment site for 
modification, for example, by PEGylation, may be disadvantageous because not all modifications may result 

35 in biologically active compounds and because steps must be taken to prevent PEGylation at N-termIni in 
cases where N-terminat PEGylation is not desired. 

SUMMARY OF THE INVENTION 

40 This invention provides materials and methods for site specific covalent modification of polypeptides, 
particularly and preferably human IL-3, granulocyte colony stimulating factor (G-CSF) and erythropoietin 
(EPO) polypeptides, permitting the production of compositions comprising homogeneously cys modified IL- 
3s, G-CSFs and EPOs and pharmaceutical compositions containing the same. "Homogeneously cys 
modified" as the term is used herein means substantially consistently modified only at specific, inserted or 

45 substituted cysteine residues. A homogeneously modified IL-3 for example, includes an IL-3 composition 
which is substantially consistently modified at position 6 (using the convention of counting from the N- 
terminus of the mature protein) by the insertion of cysteine in place of the threonine of natural IL-3, but not 
at other positions. 

Thus, this invention first provides cysteine added variants ("CAVs") of IL-3. G-CSF and EPO. CAVs of 
50 this invention encompass IL-3. G-CSF and EPO muteins that contain at least one additional cysteine residue 
compared to the corresponding naturally occurring or previously known IL-3, G-CSF and EPO. The cysteine 
residu6(s) are introduced into the peptide structure of the CAVs at one or more amino acid positions in the 
natural or previously known counterpart. 

In the case of human IL-3, we have determined that the naturally occurring cysteine residues at 
55 positions 16 and 84 form a disulfide bridge, essential to preserving the desired biological activity of the 
polypeptide. For the addition of novel cysteines, some positions within the polypeptide, such as position 15 
and 51 are unsuitable; cysteines introduced at these positions give rise to human IL-3 polypeptides with 
substantially reduced biological activity. However, certain substitutions or deletions of residues 1-14 do not 
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significantly diminish the desired biological activity of IL-3. Therefore, a preferred region of novel cysteine 
introduction into the polypeptide is within positions 1-14 inclusive. Currently, positions 6-12 inclusive are 
especially preferred sites for cysteine introduction. The subsequent attachment of sulfhydryl reactive 
compounds, including polymers, as discussed below, to the novel cysteines added at selected positions 

5 within this region will not result in any significant toss of biological activity. 

By "cysteine added variant" as the term is used herein, we mean variants of IL-3, G-CSF and EPO that 
are modified In amino acid staicture relative to naturally occurring or previously known counterparts such 
that at least one cysteine residue is inserted into the natural or previously known sequence and/or is used 
to replace a different amino acid within that sequence. 

10 Additionally, with respect to IL-3. the native or "natural" tL-3 sequence, with an added initiator 
methionine for bacterial expression, may be further modified such that the first alanine is deleted at the N- 
terminus of the mature polypeptide, altering the amino terminal sequence from METALATRO to METTRO 
(the "mp" mutein). For the *'mp'' mutein, such N-terminus modification permits more consistent removal of 
the N-terminal methionine. As is already known, in bacterial expression systems, cleavage at the N-terminal 

75 methionine occurs. Likewise, the native EPO N-terminal sequence (with the added MET) begins 
METALATRO and it may prove advantageous to delete the first alanine to obtain an mpEPO mutein. With 
regard to G-CSF, the natural human N-terminal sequence begins with METTHR'PRO (with the MET added 
for bacterial production) and it may be desirable to delete this N-terminal threonine to advantageously 
obtain a mpG-CSF mutein. 

20 Alternatively, the natural IL-3 sequence may be further modified such that the first two amino acids at 
the N-terminus of the mature polypeptide are deleted, leaving a terminus beginning with 
METTHR*GLirTHR* (the "m3" mutein). For the "m3" mutein, such N-terminus modification permits one to 
take advantage of the methionine at position 3 in the naturally occurring human IL-3 molecule, as the 
initiator methionine. 

25 The CAVs of this invention make it possible to produce homogeneous, biologically active IL-3, G-CSF 
and EPO compositions substantially specifically and consistently modified at selected positions with 
sulfhydryl reactive compounds (described hereinafter). 

In the practice of this invention, at least one cysteine residue is introduced in that portion of the IL-3, G- 
CSF or EPO polypeptide where modification via a sulfhydryl reactive compound is desired. The cysteine 

30 residue or residues are so Introduced by genetic engineering methods as described below. Novel cysteine 
residues may be engineered into the polypeptide for example, by simple insertion of a cysteine codon into 
the DNA molecule at the desired site or by converting a desirably located asparagine or other codon to a 
cysteine codon. Convenient methods for site specific mutagenesis or DNA synthesis for producing a DNA 
molecule encoding the desired CAV, expression in procaryotic or eucaryotic host cells of the DNA molecule 

35 so produced, and recovery of the CAV produced by such expression are also disclosed. 

The CAVs of this invention retain useful biological properties of the natural or previously known protein 
and may thus be used for applications identified for the non-modified parent. Modification with such 
sulfhydryl reactive compounds, however, is preferred. Such biologically active, modified CAVs can be 
produced in homogeneous compositions which, it is contemplated, will provide improved pharmacokinetic 

40 profiles, immunogenicity profiles, and/or solubility characteristics relative to the parent polypeptides. 
Furthermore, CAVs may enable the formation of multlmeric forms of the normally monomeric polypeptide 
with the same, albeit improved characteristics. Multimeric CAVs also enable the formation of "hetero- 
conjugates"- i.e., two or more distinct polypeptides joined via the sulfhydryl groups of the added cysteine 
residues, e.g.. IL-3 joined to EPO or IL-3 joined to G-CSF. 

45 Biological activity of the CAVs before or after modification with the sulfhydryl reactive compounds may 
be determined by standard in vitro or in vivo assays conventional for measuring activity of the parent 
polypeptide. Alternatively, we provide herein a "small scale" screening method wherein successful Cys 
modification and attachment of the sulfhydryl reactive compound may be tested. 

Selective and homogeneous modification of the CAVs with sulfhydryl reactive compounds is possible 

50 since such compounds will covalently bond primarily only to the cysteine residue(s) in the CAV. Secondary 
reactivity at His, Lys and Tyr residue(s) may be observed, depending on the choice of sulfhydryl reactive 
compound, but at a significantly lower rate. The modified CAVs so produced may then be recovered, and if 
desired, further purified and formulated into pharmaceutical compositions by conventional methods. 

Sulfhydryl reactive compounds include compounds such as polyalkylene glycol, e.g. polyethylene and 

55 polypropylene glycol, as well as derivatives thereof, with or without coupling agents or derivatization with 
coupling or activating moieties, for example, with thiol, triftate, tresylate, aziridine or oxirane, or preferably 
with S-pyridyl or maleimide moieties. Compounds such as S-Pyridyl Monomethoxy PEG and Maleimido 
Monomethoxy PEG are exemplary. Additionally, sulfhydryl reactive compounds include, but are not limited 
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to, charged or neutral polymers of the following types: dextran. colominic acids or other carbohydrate based 
polymers, polymers of amino acids and biotin derivatives, resulting in a protein modified with this well 
known affinity reagent often used for antibody based assays. 

Briefly, the method comprises reacting the CAV with a sulfhydryl reactive compound under suitable 

5 conditions, preferably non-denaturing conditions, and in sufficient amounts permitting the covalent attach- 
ment of the sulfhydryl reactive compound to the introduced cysteine residue{s) present in the polypeptide 
backbone of the CAV. The reaction may be reducible or non-reducible; and generally, the amount of 
sulfhydryl reactive compound used should be at least equimolar to the number of cysteines to be 
derivatized. although use of excess sulfhydryl reactive compound is prefen'ed, both to improve the rate of 

10 reaction and to insure consistent modification at alt reactive sites. The modified CAV produced, may then 
be recovered, purified and formulated by conventional methods. See e.g., WO 87/00056 and references 
cited therein. 

Other aspects of the present invention include therapeutic methods of treatment and therapeutic 
compositions which employ the modified CAVs of the present invention, either alone or with other 
15 lymphokines, hematopoietins and/or growth factors, such as granulocyte macrophage colony-stimulating 
factor (GM-CSF) , macrophage colony-stimulating factor (M-CSF), IL-1, IL-2, IL-4, IL-5, IL-6 and IL-10. These 
methods and compositions take advantage of the improved pharmacokinetic properties of these modified 
CAVs to provide treatments, e.g., such as employing lower dosages ? of polypeptide, less frequent 
administration, lower immunogenicity and more desirable distribution, required for the therapeutic indica- 
20 tions for the natural polypeptide. 

Other aspects and advantages of the present invention will be apparent upon consideration of the 
following detailed description of the invention, including illustrative examples of the practice thereof. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is the human IL-3 gene construct for E. coli expression, having the polypeptide sequence shown 
of natural (wild type) human IL-3. plus an initiator methionine, as expressed in E.coIi, with the amino acids 
numbered from the N-terminus for reference to the muteins discussed herein. 

Fig. 2 is the human G-CSF gene construct for E. coli expression, having the polypeptide sequence 
shown of natural (wild type) human G-CSF, plus an initiator methionine, as expressed In E. coli with the 
amino acids numbered from the N-temninus for reference to the muteins discussed herein. 

Rg. 3 is a chemically synthesized human EPO gene construct for E. coli expression, having the 
polypeptide sequence of natural (wild type) human EPO. plus an initiator methionine, as expressed in E. coli 
with the amino acids numbered from N-terminus for reference to the muteins discussed herein. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention involves the selective modification of lL-3. G-CSF and EPO for pharmaceutical 
use. to both enhance their pharmacokinetic properties and provide homogeneous compositions for human 
40 therapeutic use. Although human IL-3, DNA and peptide sequences are preferred as the starting point in 
tills invention as it relates to IL-3. any primate IL-3 is susceptible to use in tiie method of the invention, 
given the significant homology between e.g.. human and gibbon species of the protein and DNA. See Leary 
et al.. Blood (1982) 70: 1343-1348. The method for selectively modifying lL-3, G-CSF and EPO involves 
selecting locations in the polypeptide sequence for the attachment of sulfhydryl reactive compounds. This 
45 step may be accomplished by altering the amino acid sequence of the polypeptide by Inserting cysteine 
residues at selected sites or by converting selected endogenous residues into cysteine residues. For 
example, the codons AAA or AAG. which code for lysine, can be changed to the codon TGC or TGT, which 
code for cysteine. 

CAVs in accordance with tiiis invention also include allelic variations in the protein sequence, i.e. 

50 sequence variations due to natural variability from individual to individual, or with other amino acid 
substitutions or deletions which still retain desirable biological properties of the parent. 

All CAVs of this invention may be prepared by expressing recombinant DNA sequences encoding the 
desired variant in host cells. e.g. procaryotic host cells such as E. coH, or eucaryotic host cells such as 
yeast or mammalian host cells, using methods and materials, e.g. vectors, as are known in the art. Host 

55 cells containing and capable of expressing the CAV-encoding DNA are thus encompassed by this invention. 
DNA sequences encoding the variants may be produced synthetically or by conventional site-directed 
mutagenesis of DNA sequences encoding the protein or polypeptide or analogs tiiereof. Figure 1 shows the 
human IL-3 gene construct inserted in ptasmid pAL-hlL3-781 and expressed in the E. coli K1 2 strain 
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designated QI586. This strain containing the plasmid was deposited with the American Type Culture 
Collection, 12301 Parklawn Drive, Rockville. Maryland 20852 USA on April 19, 1989 and given accession 
number 67932. Other DNA sequences for natural primate IL-3 have been cloned and the DNA sequences, 
including cDNA sequences, and specific peptide sequences for the same have been published, in PCT 

5 application number US87/01702, published as WO 88/00598 on January 28, 1988, and are therefore known 
in the art. These DNA sequences have been deposited with the American Type Culture Collection and 
given accession numbers ATCC 67154, 67326, 67319 and its replacement 68042. and 40246. DNA 
sequences for natural G-CSF and EPO have been cloned and the sequences and their corresponding 
peptide sequences published and are therefore known in the art. 

10 DNA molecules encoding natural human IL-3s, G-CSFs and EPOs therefore may be obtained (I) by 
cloning in accordance with the published methods, (ii) from the deposited plasmlds, or (iii) by synthesis, 
e.g. using overiapping synthetic oligonucleotides based on the published sequences which together span 
the desired coding region. Such methods are known in the art. See the foregoing PCT application published 
as WO 88/00598 and PCT application number US8&/00402 published as WO88/06161. 

15 As mentioned above. DNA sequences encoding individual CAVs of this Invention may be produced 
synthetically or by conventional site-directed mutagenesis of a DNA sequence encoding the parental 
polypeptides or analogs thereof. Such methods of mutagenesis include the Ml 3 system of Zoller and 
Smith. Nucleic Acids Res. (1982) 10:6487 - 6500; Methods Enzymol. (1983) 100:468-500; and DNA (1984) 
3:479-488, which uses single stranded DNA and the method of Morinaga et al., Bio/technology (July 1984) 

20 636-639, which uses heteroduplexed DNA. Exemplary oligonucleotides used in accordance with such 
methods are described below. It should be understood, of course, that DNA encoding each of the CAVs of 
this invention may be analogously produced by one skilled in the art through site-directed mutagenesis 
using appropriately chosen oligonucleotides. 

The new DNA sequences encoding the CAVs of this Invention can be introduced into appropriate 

25 vectors for heterologous expression in the desired host cells, whether procaryotic or eucaryotic. The activity 
produced by the transiently transfected or stably transformed host cells (or their progeny) may be 
measured by using standard assays conventional for the parental protein. Where the host cell Is bacterial, 
the DNA should be free of introns, e.g. a cDNA or synthetic DNA, and may be free of any secretory leader 
sequence. For eucaryotic expression, introns may be present or absent and a secretory leader sequence 

30 should preferably be present. 

The CAVs produced by expression In the genetically engineered host cells may then be purified, and If 
desired formulated into pharmaceutical compositions by conventional methods, often preferably by methods 
which are typically used in purifying and/or formulating the parental protein. It is contemplated that such 
pharmaceutical compositions containing the CAV In admixture with a pharmaceutically acceptable carrier 

35 will possess similar utilities to those of the parental proteins, such as those set forth in WO 88/00598 supra , 
at page 3. 

In another, and preferred, aspect of this invention, the CAVs produced by recombinant means as 
mentioned above are reacted with the desired sulfhydryl reactive compound under conditions permitting 
attachment of the sulfhydryl reactive moiety to the sulfhydryl group of the introduced cysteine residues In 
40 the peptide backbone of the CAV. These modified CAVs, preferably produced initially on a small scale, may 
then be screened for bioactive muteins possessing the sulfhydryl reactive compounds attached to the site 
or sites desired. Alternatively, this screening may be accomplished before attachment with the sulfhydryl 
reactive compound. 

The term "sulfhydryl reactive compound" Is defined herein as any compound having, or capable of 
45 being activated to have, a reactive group capable of forming a covalent attachment to the sulfhydryl group 
(-SH) of the cysteine residue. Included among such compounds are polymers such as PEG and poly- 
propylene glycol (PPG), dextran. colominic acids or other cariwhydrate based polymers and polymers of 
amino acids and biotin derivatives. Activation may occur by modification of the compound with a sulfhydryl 
moiety, such as a sulfhydryl group, thiol, triflate, tresylate, aziridine or oxirane, or preferably, with S-pyridyl 
50 or maleimide. The sulfhydryl reactive compound need not have any particular molecular weight, but a 
molecular weight of between about 1,000 and 30,000 for the activated compound is preferred, especially for 
PEG. Methods of attachment will be described in detail below. By controlling the number and location of the 
cysteines In the CAV sequence, the number and location(s) of the attached sulfhydryl reactive compound 
can be selectively controlled. Such control of attachment location and number enables the production of 
55 only certain selectively modified molecules retaining the desired biological activity, rather than production of 
a heterogeneous mixture of variably modified molecules, only some of which may be active. It is also 
important to note that this positional selectivity of the PEGylation or other attachment allows the normal 
functional Interactions of the protein to be preserved, blocked, or regenerated by release of the sulfhydryl 
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reactive compound. 

Another aspect of the invention is therefore homogeneous compositions of modified CAVs as described 
herein, e.g. PEGylated CAVs. Specific embodiments of IL-3 CAVs of the invention include human IL-3 which 
has a cysteine residue replacing lysine at position 10 and the m3 initiation sequence. (Amino acid numbers 

5 for the CAVs of the present invention are used herein in the conventional manner, sequentially from the N- 
terminus, and correlate with the numbering system used in Rg. 1 for the natural human IL-3 as expressed 
in E. coli.) Similarly, the naturally occurring lysine residue in human IL-3 (Fig, 1) at amino acid position 100 
may be converted to a cysteine to create a human IL-3 CAV of the invention. Another embodiment has 
cysteine at positions 9 and 10 and the m3 initiation sequence. 

10 Specific embodiments of G-CSF and EPO CAVs of the invention include human G-CSF which has an 
alanine residue replacing the naturally occurring cysteine residue at position 17 and a cysteine residue 
replacing the naturally occurring alanine at position 37, and human EPO which has a cysteine residue 
replacing serine at position 9. The modification of Ala17 of G-CSF is made to prevent possible improper 
disutflde bridge formation. 

75 For bacterial expression where the secretory leader-encoding DNA sequence is removed from the CAV- 
encoding DNA. it may be desirable to additionally modify the sequence such that it encodes an N-terminus 
comprising Met-Pro— (the mp mutein) instead of other N-termini such as Met-Ala-Pro (in IL-3 and EPO) or 
Met-Thr-Pro (in G-CSF). Such N-terminal modification permits more consistent removal of the N-terminal 
methionine. Altematively, the first two residues of natural, human IL-3 may be deleted, leaving the naturally 

20 occurring methionine at position 3 as the translation initiator (the m3 mutein). 

CAVs of this invention, modified as described, encompass CAVs containing other modifications as well, 
including truncation of the peptide sequence, deletion or replacement of additional amino acids with amino 
acids other than cysteine, insertion of new N-llnked glycosylation sites, abolishment of natural N-linked 
glycosylation sites, etc., so long as the bioactivlty of the molecule is retained. Thus, this invention 

25 encompasses CAVs encoded for by DNA molecules which are capable of hybridizing under stringent 
conditions to the DNA molecule encoding the parental IL-3, G-CSF or EPO (or would be so capable but for 
the use of synonymous codons) so long as the encoded polypeptide contains one or more additional 
Introduced cysteine residues relative to the parental peptide sequence. Exemplary stringent conditions can 
be found In T. Maniatis, et al.. Molecular Cloning (A Laboratory Manual) , Cold Spring Harbor Laboratory 

30 (1982). pages 387-389. An example of one such stringent hybridisation condition is 4XSSC at 65°C, 
followed by a washing in 0.1XSSC at 65**C for an hour. Alternatively, an exemplary stringent hybridization 
condition is 50% formamide. 4XSSC at 42*^0. 

Because the method and compositions of this invention provide homogeneous modified IL-3s, G-CSFs 
and EPOs. the invention also encompasses such homogeneous compositions for pharmaceutical use which 

35 comprise a therapeutically effective amount of a modified CAV described above in admixture with a 
pharmaceutically acceptable canrier. Such composition can be used in generally the same manner as that 
described for the natural or recombinant polypeptides. It is contemplated that the compositions will be used 
for treating a variety of conditions, e.g. involving stimulating hematopoiesis or improving a patient's 
hematological profile. For example, a modified IL-3 of the present invention may be used as an adjunct to 

40 cancer chemotherapy, radiation therapy, or in the treatment of immune disorders, as discussed in WO 
88/00598, at page 17-19. The exact dosage and method of administration will be determined by the 
attending physician depending on the particular modified CAV employed, the potency and pharmacokinetic 
profile of the particular compound as well as on various factors which modify the actions of drugs, for 
example, body weight, sex, diet, time of administration, drug combination, reaction sensitivities and severity 

45 of the particular case. Generally, the dally regimen should be in the range of the dosage for the natural or 
recombinant unmodified protein, e.g. a range of about 0.1 to about 100 ug of polypeptide per kilogram of 
body weight, preferably from about 0.1 to about 30 ug of polypeptide per kilogram of body weight. 

The therapeutic method and compositions of the present invention may also include co-administration 
with other drugs or human factors. A non-exclusive list of other appropriate hematopoietins, CSFs (colony 

50 stimulating factors) and interleukins for simultaneous or serial co-administration with the CAVs of the 
present invention Includes GM-CSF. CSF-1 (in its various known forms: CSF-1 is also referred to as M-CSF 
or macrophage colony-stimulating factor), Meg-CSF, IL-1, IL-2, IL-4, IL-6, IL-10, B-cell growth factor, B-cell 
differentiation factor and eosinophil differentiation factor. Additionally, the CAVs of the present invention may 
be administered with, or chemically attached to, monoclonal or polyclonal antibodies in a therapeutic use. 

55 Altematively, these growth factors may be attached to certain toxins, e.g., ricin, for use In a therapeutic 
regimen. The dosage recited above would be adjusted to compensate for such additional components in 
the therapeutic composition or regimen. In the case of pharmaceutical compositions containing modified 
lymphokine CAVs, for exampi . progress of the treated patient can be monitored by periodic assessment of 
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the hematological profile, e.g. white cell count, hematocrit and the like. 

The following examples illustratively describe the CAVs and the methods and compositions of the 
present invention. 

5 EXPERIMENTAL MATERIALS, METHODS AND EXAMPLES 

EXAMPLE 1: Eucaryotic Expression Materials and Methods 

Eukaryotic cell expression vectors into which DNA sequences encoding CAVs of this invention may be 
10 inserted (with or without synthetic linkers, as required or desired) may be synthesized by techniques well 
known to those skilled in this art. The components of the vectors such as the bacterial replicons. selection 
genes, enhancers, promoters, and the like may be obtained from natural sources or synthesized by known 
procedures. See Kaufman et aJ.. J. Mol. Biol. , (1982) 159: 601-621, Kaufman, Proc. Natl. Acad. Sci. (1985) 
82:689-693. See also WO 87/04187, filed January 2, 1987 (pMT2 and pMT2-ADA), and US Patent 
75 Application Serial No. 88,188, filed August 21, 1987 (pxMT2). Exemplary vectors useful for mammalian 
expression are also disclosed in the patent applications cited in Example 4. which are hereby incorporated 
by reference. Eucaryotic expression vectors useful in producing variants of this invention may also contain 
inducible promoters or comprise inducible expression systems as are known in the art. See US Patent 
Application Serial No. 893.115 (filed August 1. 1986) and PCT/US87/01871. published as WO8a^00975 on 
20 February 11, 1988. 

Established cell lines, including transformed cell lines, are suitable as hosts. Normal diploid cells, cell 
strains derived from in vitro culture of primary tissue, as well as primary explants (including relatively 
undifferentiated cells such as hematopoietic stem ceils) are also suitable. Candidate cells need not be 
genotypically deficient in the selection gene so long as the selection gene is dominantly acting. 

25 If eucaryotic host cells are used, they will preferably will established mammalian cell lines. For stable 
integration of the vector DNA into chromosomal DNA, and for subsequent amplification of the integrated 
vector DNA, both by conventional methods, OHO (Chinese Hamster Ovary) cells are presently preferred in 
such embodiments. Altematively. the vector DNA may include all or part of the bovine papilloma virus 
genome (Lusky et al.. Ce|l (1984) 36: 391-401) and be carried in cell lines such as C127 mouse cells as a 

30 stable eptsomal element. Other usable mammalian cell lines include HeLa, COS-1 monkey cells, melanoma 
cell lines such as Bowes cells, mouse L-929 cells, 3T3 lines derived from Swiss, Balb-c or NIH mice, 6HK 
or HaK hamster cell lines and the like. 

Stable transformants then are screened for expression of the CAV product by standard immunological 
or activity assays. The presence of the DNA encoding the CAV polypeptides may be detected by standard 

35 procedures such as Southern blotting. Transient expression of the CAV genes during the several days after 
introduction of the expression vector DNA into suitable host cells such as COS-1 monkey cells is measured 
without selection by activity or immunologic assay of the proteins in the culture medium. 

Following the expression of the DNA by conventional means, the CAVs so produced may be recovered, 
purified, and/or characterized with respect to physicochemical, biochemical and/or clinical parameters, all by 

40 known methods. 

EXAMPLE 2: Bacterial and Yeast expression 

Bacterial and yeast expression may be effected by inserting (with or without synthetic linkers, as 
45 required or desired) the DNA molecule encoding the desired CAV into a suitable vector (or inserting the 
parental DNA sequence into the vector and mutagenizing the sequence as desired therein), then transform- 
ing the host cells with the vector so produced using conventional vectors and methods as are known in the 
art. e.g. as disclosed in published PCT Application No. WO 86/00639. published January 30, 1986. 
Transformants are identified by conventional methods and may be subcloned if desired. Characterization of 
50 transformants and recombinant product so produced may be effected and the product recovered and 
purified, all as described in Example 1. 

For bacterial expression, the DNA sequences encoding the CAVs are preferably modified by conven- 
tional procedures to encode only the mature polypeptide and may optionally be modified to include 
preferred bacterial codons. 

55 
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Expression in E. coli : 

The IL-3 CAVs of Example 5 were expressed in E, coli as follows: Plasmid pAL-hlL3-781 was 
transformed into an E, coli K12 strain GI586, a derivative of strain W31 10 in wtiich the C| and Rex regions of 

5 bacteriophage lambda canning the Cf857 allele have been inserted into the Clal site of the lacZ gene of the 
bacterial genome. This insert consists of all of the DNA sequences between nucleotides 35711 and 38104 
of the phage genome. See F. Sanger, et al., J. Mol. Biol. (1982) 162:729. E. coli K12 strain GI586 {pAL-hlL3- 
781) was deposited at the ATCC on April 19, 1989 and given accession number 57932. 

When GI586 transfomied with pAL-hlL3-781 is grown at 30 degrees centigrade to high cell density and 

10 then heated to 40 degrees centigrade, IL-3 is produced rapidly and accumulates over the next two or three 
hours to reach greater than 10 percent of the total cellular protein. This protein is produced in an insoluble 
form which must be solubilized and refolded by conventional methods. See, e.g., T.E. Creighton, Prog. 
Biophys. Molec. Biol. (1978) 33:231-297. Following expression, the CAVs so produced were recovered, 
purified and characterized as follows. 

75 The G-CSF CAV of Example 5 and the EPO CAV of Example 6 were expressed similarly in E. coli by 
removing the DNA sequence encoding CAV IL-3 from pAL-hlL3-781 and inserting the appropriate G-CSF or 
EPO DNA sequence as set forth in those Examples. Transformations were canied out under the same 
conditions set forth above. 

20 1. Purification of CAV IL-3 

All buffers were prepared using glass distilled water; all were degassed for at least five minutes, using 
house vacuum/sonification. prior to the addition of DTT. 

First, 400 grams wet weight frozen E. coli cell paste was suspended in 2500 ml of buffer containing 50 

25 mM Tris-HCI, pH 8.5, 1 mM EDTA, 5 mM P-aminobenzamidine, 1 mM PMSF and 2 mM DTT (hereinafter in 
this Example "buffer A"), to obtain a final volume of 2850 ml. Glass rods and magnetic slirers were used to 
resuspend the cell paste. Then the cell suspension was lysed by passing it through a matin gaulin valve at 
9000 psi four times, with cooling between each time. Temperature was maintained below 30 degrees 
centigrade by collection of the lysate into glass vessels cooled in ice/water mixture. Protein concentration 

30 was 22 mg/ml; final volume was 2850 ml. 

The lysate was centrifuged for 30 minutes at 8000 rpm in a Sorval centrifuge with a GS-3 rotor. The 
supernatant (2600 ml at 17.0 mg/ml) was discarded and the resultant pellet (hereinafter in this Example PI) 
from this centrrfugation was resuspended in approximately 400 ml buffer A, using glass rods and a 
magnetic stinrer. The milky suspension was then passed through an 18 gauge needle using a 60 ml syringe. 

35 The final volume was 640 ml, with a protein concentration of 25.6 mg/ml. 

The resuspended PI pellet was then centrifuged for 10 minutes in a Sorval centrifuge with a GS-3 rotor 
at 8000 rpm. The supernatant from this centrifugation was poured into two fresh centrifuge tubes 
(hereinafter in this example "82") and the resultant pellet ("P2") was resuspended in buffer A to a final 
volume of 165 ml, with a protein concentration of 50 mg/ml. The S2 supernatant was then centrifuged for 10 

40 minutes and the resulting pellet ("P3") was resuspended in 65 ml buffer A. The resulting supernatant (''S3'') 
was further centrifuged for 10 minutes and the resulting P4 pellet was resuspended in buffer A to a final 
volume of 50 ml. with a protein concentration of 11.3 mg/ml. Because the P4 pellet contained so little IL-3, it 
was not used in subsequent steps. The S4 supernatant from the final centrifugation, approximately 600 ml, 
contained the membranous components at a concentration of approximately 10 mg/ml. 

45 The P2 and P3 pellets were pooled and centrifuged at 9000 rpm (GSA rotor) for 10 minutes yielding 
two pellets (''P2-2'*) and a cloudy supernatant, which using HPLC analysis was found void of IL-3 and was 
discarded. The P2-2 pellet was frozen at -20 degrees centigrade for later use. 

The frozen P2-2 pellet was then resuspended in buffer A (which contained 10 mM DTT rather than 2 
mM DTT) to a final volume of 100 ml using glass rods and magnetic stin-er and then passed through an 18 

50 gauge needle. 400 ml of 7 M fresh guanidine in the 10 mM DTT buffer A was added to the resuspended 
P2-2 and after one quick inversion, the solubilized P2-2 pellet was immediately placed in 3 x 250 centrifijge 
tubes and centrifuged for 15 minutes at 8000 rpm (GSA rotor). 500 ml of the supernatant at a concentration 
of 5.98 mg/ml was purified further at room temperature by RP-HPLC. The foregoing two steps were 
performed in 17 to 22 minutes. 

55 This purification protocol may be applied to the purification of the G-CSF and EPO CAVs expressed in 
E. coli as set forth above with similar results. 
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2. RP-HPLC separation of IL-3 CAVs 

The buffers used in this separation protocol were 0.1% (v/v) TFA in water, and 0.1% TFA in acetonitrite. 

A two inch Vydac C4 column was equilibrated in 10% acetonitrile. The supernatant fronn the 7 M 
5 guanidine solubilization was immediately applied onto the C4 column having a volume of approximately 470 
ml at 180 ml per minute. The column was developed at 20 ml per minute and was washed in 10% 
acetonitrile until absorbance at 280 nm was back to baseline. The following gradient was established by 
washing with the following concentrations of acetonitrile at the following times: 



10 



time (in minutes) 


% acetonitrile 


5 


10 


10 


35 


55 


55 


60 


80 


65 


80 


67.5 


10 



40 ml fractions were collected after 35 minutes into the gradient. 10 ul samples were removed from 
each fraction, vacuum speed dried and taken up in 20 ul of 2x SDS-sample Laemmli buffer. SDS-PAGE 
analysis was performed and IL-3 presence was contirmed. All fractions were then frozen at-80 degrees 
centigrade 

Similariy, RP-HPLC separation of G-CSF and EPO CAVs may be accomplished. 

3. Refolding of IL-3 CAVs 

One of the RP-HPLC separated fractions containing approximately 75 mg (7.5 ml) IL-3 was diluted to 
approximately 0.5 mg/ml by the addition of 142.5 ml of 6.4 M guanidine in 50 mM NaPO^ pH 7.0, 1 mM 
EDTA and 0.2 mM DTT. The mixture was then added to 750 ml of 50 mM NaPO* pH7, 1 mM EDTA, 0.2 
mM DTT buffer, transferred to dialysis tubing and dialyzed for two hours against 4 L of tiie same buffer. 
The IL-3 (now approximately 0.22 M guanidine) was twice further dialyzed against 8 L of the same buffer 
containing 0.1 mM DTT. 

PEGylation of tiiis purified IL-3 is set forth in Examples 7 and 8 below. 

Refolding of G-CSF and EPO CAVs may be accomplished in the same manner. 

4. Confirmation of bioactivity 

Bioactivity of IL-3 CAVs or EPO CAVs may be confirmed by using ttie TF-1 cell proliferation assay. The 
TF-1 cell line has been described (Kitamura et al.. Blood (1989) 73:375-380). 

In tiiat assay, cells are maintained at 37 degrees centigrade in humid air containing 5% CO2 and 
culture media used is RPMI (Gibco). 10% heat inactivated fetal calf serum, 2 mM L-glutamine, with 5 ng/ml 
of recombinant GM-CSF added. Every 3-4 days cells are adjusted to a density of 2 x 10^ cells/ml. Just 
prior to assay the cells are centrifuged 500xG, 5 minutes, washed in culture media without rGM-CSF. 
recentrifuged and resuspended at a density of 10^ cells/ml. 

IL-3 or EPO samples to be assayed are diluted between 1:500 and 1:10,000 in culture media witiiout 
rGM-CSF. 125 IL\ of the diluted sample is placed in the top row of a 96 well microtiter plate. The remaining 
wells are filled with 100 ul of culture media without rGM-CSF and tiie top row samples are serially diluted 
five fold down the microtiter plate. To each well, 100 ul of diluted cells (10^ cells) are added and the plate 
is incubated at 37 degrees centigrade, 5% C°2 for 48-72 hours. Thereafter. 0.5 uCi ^H-thymidine is added 
per well and the plate is further incubated for 4-6 hours. Cells are then harvested using an automated cell 
harvester (LKB 1295-001) and the ^H-thymidine uptake is quantitated. 

Alternatively for IL-3, a CML proliferation assay as described in PCT/US87/017024, International 
Publication Number WO88/00598, published January 28, 1988. can be used. 

Bioactivity of G-CSF CAVs may be confirmed using the 32D murine cell line, as described in Hapel, et 
al.. Blood (1984) 64:786-790, and adding 5% v/v of WEHI-3 conditioned media from a 48 hour culture of 
WEHI-3 cells. ATCC TIB68. (1x10^ cells/ml) in RPMI-1640 media supplemented witii 2 mM L-glutamine 
instead of the 5 ng/ml recombinant GM-CSF in the TF-1 cell proliferation protocol. The wash and assay 
steps are then carried out in the absence of WEHI-3 conditioned media. 
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EXAMPLE 4: Mutagenesis Protocol 

Site directed mutagenesis may be effected using conventional procedures known in the art. See e.g., 
International Applications Nos. WO 87/07144. and WO 87/04722, and US Patent Application Serial Nos. 
5 099,938 (filed September 23, 1987) and 088.188 (filed August 21 , 1987) and the references cited therein. 

EXAMPLE 5: Exemplary Mutagenesis Reactions 

The following human IL-3, G-CSF and EPO muteins were engineered by substitution of the codons 
10 indicated for a cys codon, or by insertion of a cys codon, using conventional site directed mutagenesis 
techniques: 
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IL-3 



70 



J5 



20 



25 



35 



40 



45 



mp mute in Tn3 routein cys modification 

mpCyslO m3cysl0 AAA to TGC (Lys to Cys) 

mpCyse m3Cys6 ACT to TGC (Thr to Cys) 

npCysS m3Cys8 TCT to TGC (Ser to Cys) 

inpCysl2 itt3Cysl2 TCT to TGC (Ser to Cys) 

mpCyslOO m3CyslOO AAG to TGT (Lys to Cys) 

mpCysl34 m3Cysl34 Insertion of TGT between 

TTC and TAG (Cys between 
Phe 133 and stop codon) 

mpCysS ATG to TGC (Met to Cys) 

mpAlCysl9 Replacement of amino acids 

1-15 with the "mp" 
terminus and modif. of 
pos. 19 from ATG to TGC 
(Met to Cys) 

in3Cys6,10 ACT and AAA to 

TGC (Thr and Lys 
to Cys) 

m3Cys9,10 TTA and AAA to TGC (Leu 
and Lys to Cys) 

m3Cys6,8 ACT and TCT to TGC (Thr 
and Ser to Cys) 

m3Cys6,8,10 ACT, TCT and AAA to TGC 
(Thr, Ser and Lys to Cys) 

in3Cys8,9,10 TCT, TTA and AAA to TGC 
(Ser, Leu and Lys to Cys) 

G-CSF 



mp mute in 
mpAlal7Cys37 



cys modificatiot> 

GCC to TGC (Ala to Cys) 



50 



mp mute in 
mpCys9 



EPO 



CVS modification 
TCT to TGT (Ser to Cys) 



In the examples depicted above the modification site of the natural IL-3. G-CSF or EPO protein is 
55 designated by the number after *'Cys" and the amino acid sequence of the CAV is identical to that of the 
native protein, except for the position indicated, with respect to the N-terminus (see Rg. 1). The "mp" and 
"m3" designations signify the two different alterations of the N-terminus that will be discussed in detail 
below. Additionally, cys may be introduced in place of native IL-3 codons. for example at positions 63 or 66. 
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alone or in combination with other cys Introduction(s), e.g. at position 10~with any of the described N- 
termini. Contemplated EPO muteins include EPO mpCys166. having the mp N-terminus and the native 
arginine at position 166 deleted and replaced with cysteine, and mpCys24Cy$38Cys83, having the mp N- 
terminus and the three N-linked glycosylation sites at the asparagine amino acids replaced with cysteines. 

5 With respect to IL-3 muteins. certain point modifications may result in partial loss of biological activity 
or Inability of the sulfhydryl reactive compound to attach. For example, modification at position 28 results in 
a biologically active CAV. but attachment of a sulfhydryl reactive compound fails, possibly because position 
28 appears internally in the refolded CAVs tertiary structure. Compare, Wingfield, D., et al,, Eur. J. Biochem. 
(1989) 179:565-571, in which the authors discussed the Cys modification of IL-1^ at position 138 to active 

10 IL-1 ^-phycoerythrin conjugate. Additionally, we have found that substitution of a cysteine residue for the 
amino acids at positions 15 or 51 of the natural human IL-3 may result in partial loss of bioactivity. To test 
for activity after attachment of the sulfhydryl reactive compound, this Invention further provides a "small 
scale" screening technique to readily determine whether modification and attachment has been successful 
(see Example 9 below). 

js The human IL-3 was additionally modified at its N-terminus in two different and alternative configura- 
tions, represented by the "mp" and "m3" designations. The "mp" designation indicates a deletion of the 
first alanine in the natural human IL-3 protein, thereby changing the N-terminat sequence from 
MET*ALA*PRO to MET*PRO. The "m3" designation indicates a deletion of the first two amino acids In the 
natural human IL-3 protein, METALATRO, to yield a terminus beginning METTHR^GLUTHR'. The reasons 

20 for these modifications have already been discussed. With respect to N-terminus modification of the IL-3 
mpA1Cys19 mutein, amino acids 1-15 were deleted and replaced with the "mp" terminus. The human G- 
CSF and EPO muteins were additionally modified at the N-temninus by deletion of the first amino acid to 
obtain "mp" muteins. 

It should be understood of course that the depicted list of muteins is merely exemplary and not 
25 exclusive. The design and synthesis of alternative and additional muteins in accord with this invention Is 
well within the present skill in the art. Synthesis of such muteins may be conveniently effected using 
conventional techniques and methods. 

One skilled in the art, of course, could readily design and synthesize other muteins for substitution of 
cysteine codons or insertion thereof in DNA sequences encoding IL-3, G-CSF and EPO. To modify more 
30 than one site, mutagenesis may be carried out tteratively, or in some cases using an oligonucleotide 
designed for mutagenesis at more than one site. 

EXAMPLE 6: Synthesis of DNA molecules encoding CAVs 

35 As an alternative to the production of CAV-encoding DNA by mutagenesis of the parental DNA 
sequence, it should be understood that the desired CAV-encoding DNA may be prepared synthetically. In 
that case, it will usually be desirable to synthesize the CAV DNA in the form of overlapping 
oligonucleotides, e.g. overlapping 50-80mers, which together span the desired coding sequence and contain 
the cysteine additions desired: 

40 



An exemplary EPO mutein made in accordance with this Example is EPO mpCys9. The chemically 
synthesized cDNA shown in Rgure 3 is assembled, modified at position 9, and purified using techniques 
know to those skilled in the art. See Wosnick, et al.. Gene 60 :115-127 (1987); see also U.S. Patent No. 
50 4.904,584. The synthetic cDNA is designed with "overhanging end" nucleotide sequences compatible with 
those generated by the restriction enzymes. Ndel and Xbal. The purified, synthetically derived. cDNA is 
ligated with the purified Ndel-Xbal vector portion of plasmid pAL-hiL3-781 . resulting in the replacement of 
human IL-3 cDNA with human erythropoietin cDNA. EPOmpCys166 and EPO mpCys24Cys38Cys83 can be 
made in the same manner. 

55 Given a desired coding sequence, the design, synthesis, assembly and ligation, if desired, to synthetic 
linkers of other appropriate oligonucleotides is well within the present level of skill in the art. 
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EXAMPLE 7: PEGyiation of the IL-3 mpCyslO mutein 

The mutein human IL-3 mpCyslO was prepared in accordance with Example 5 above and PEGylated 
with two PEG 5000 derivatives, S-Pyridyl Monomethoxy PEG 5000 (PEG 5000 SPDP) and Maleimldo 
5 Monomethoxy PEG 5000 (PEG 5000 SMCC). 

1 . PEGyiation with S-Pyridyl Monomethoxy PEG 5000: a reducible linkage 
a.) Preparation of the sulfhydryl reactive compound. 

70 

PEG 5000 was activated for attachment to a sulfhydryl group as follows. 2.0 grams of Monomethoxy 
PEG 5000 amine was dissolved in 12 ml dry peroxide free, dioxane. 144 mg (15% excess) of N- 
succinimidyl-3-(2 pyridyldithio) propionate (SPDP) was added as a dry powder and the reaction was allowed 
to proceed at room temperature. After 24 hours, the S-pyridyl Monomethoxy PEG 5000 product was 
75 precipitated using dry, peroxide free diethyl ether and washed with ether. The product was dried under 
vacuum to obtain 1 .92 grams of white solid, which was identified as S-Pyridyl Monomethoxy PEG 5000 by 
NMR and IR. The PEG 10.000 analog (PEG 10,000 SPDP) was likewise prepared via an analogous 
procedure. 

20 b.) PEGyiation of mutein CIO human IL-3. 

For this coupling, natural (wild type) human IL-3 was also treated with the PEGyiation reagents as a 
negative control. A stock solution at 1 mg/ml of the mpCyslO mutein in a pH 7 buffered solution of 50 mM 
NaHpPOf . 100 micro M DTT. 1 mM EDTA and about 3mM Guanidine HCl was used. DTT was added to 

25 prevent dimerizatlon of the protein: EDTA was added to prevent dimerization via metal mediated oxidative 
coupling. Guanidine remains as an artifact of the refolding of the protein. A pH 7 was used; a range of 6.5- 
7.5 is preferred. 0.9 mg of S-Pyridyl Monomethoxy PEG 5000, prepared as set forth above, was weighed 
into an Eppendorf tube. 360 microliters of the buffered mutein was added and the mixture was vortexed 
briefly to homogeneity. The reaction was performed at 4 degrees centigrade and when sampled after 2 

30 hours, was found to be complete. Analysis on a 10-20% gradient SDS acrylamide gel stained with 
Coomassie blue showed the product as nearly pure and running at about 28 kD. (By comparison, mpCyslO 
and its dimer were used as standards and found to migrate to 15 and 30 kD respectively.) A reducing lane 
on the gel showed that the PEGylated IL-3 mutein is sensitive to reduction by DTT and regenerated the 
original protein at about 15 kD. 

35 

2. PEGyiation with Maleimido Monomethoxy PEG 5000: a non-reducible linkage 

a. ) Preparation of the sulfhydryl reactive compound. 

40 In this experiment, PEG 5000 activation was accomplished as follows. 2.0 mg of monomethoxy PEG 
5000 amine was dissolved in 12 ml of dry. peroxide free dioxane. 154 mg of sulfosuccinimidyl 4-(N- 
maleimidomethyl) cyclohexane-1-carboxylate (SMCC) (15% excess) was added as a dry powder and the 
reaction was allowed to proceed at room temperature. After 24 hours, work up of the product was carried 
out in the same manner as the S-Pyridyl Monomethoxy PEG 5000 to obtain 1.82 grams Maleimido 

45 Monomethoxy PEG 5000. The PEG 10,000 analog was prepared similarly. 

b. ) PEGyiation of the cyslO IL-3 mutein. 

This reaction was carried out in the same manner as the PEGyiation reaction using the reducible PEG 
50 5000 reagent and with natural human lL-3 as a negative control. However, 1.0 mg of the PEG derived 
PEGylating agent Maleimido Monomethoxy PEG 5000 was used. 400 microliters of the mpCyslO IL-3 
mutein was added and vortexed to homogeneity. At t=2 hours the reaction was found to be complete. The 
product was nearly pure and indistinguishable from the S Pyridyi derived conjugate in molecular weight. 
However, this product is perfectly inert to reductive conditions, such as DTT; in this reducing lane the 
55 product, at 28 kD. persists. 

In both control reactions, nothing indicative of conjugation is evident at 2 hrs or even at 24 hrs. 
Selectivity for accessible sulfhydryls in this chemistry is therefore very high. 
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EXAMPLE 8: PEGylation of multiple cysteine muteins m3Cys9.10 and m3Cys6.10 



In this experiment, protein stock for both muteins was at 300 ug/ml in the phosphate buffer solution, as 
described In Example 7. PEGylation stock solutions consisted of the S-pyridyl or Maleimlde activated PEG 
5 5000 polymers at 50 ug/ml in the same buffer. To initiate the reaction. 11 ul of the appropriate PEG stock 
was added to 100 ul of the appropriate protein stock (either the m3Cys9,10 mutein or the m3Cys6,10 
mutein) while vortexing. Reactions were allowed to proceed at 4 degrees centigrade overnight. SDS gel 
analysis of the products as described above revealed that only a trace of starting material remained with 
both chemistries. Furthennnore, tx)th chemistries resulted in new products with a gel mobility of about 37 kD. 
10 Reducing lanes on this same gel show that the maleimide conjugate is resistant to reducing, while the S- 
Pyridyl derived conjugate reverts to starting material. 

EXAMPLE 9: PEGylation of the G-CSF mpAla17Cys37 mutein 

The mutein human G-CSF mpAla17Cys37 was prepared in accordance with Example 5 above and 
PEGylated with PEG 5000 SPDP, PEG 5000 SMCC and PEG 10,000 SPDP. The natural cysteine at position 
17 was deleted and replaced with alanine to prevent possible improper disulfide bridge formation. The 
mpAla17Cys37 G-CSF protein stock was dialyzed Into a pH 7.0 buffering solution of 50mM NaH2P0+ and 
ImM EDTA, lOOuM DTT and concentrated to approximately lOOug/ml. Total volume was 670 ul. 

Stocks of the PEGylation reagents (PEG 5000 SPDP, PEG 5000 SMCC, and PEG 10.000 SPDP) were 
made up fresh at lOmM in HzO (18 x stock solutions prepared as described in Example 7, parts la and 
1 b). The reactions were carried out using the reagents and amounts set forth below. 



Rxn# 


PEG Reagent 


Stock 


ul Protein 


Ul PEG 


1 


5000 SPDP 


170 


10 


2 


5000 SMCC 


170 


10 


3 


10.000 SPDP 


170 


10 



All reactions were ImM in PEG reagent. As in Example 7, the mutein was added to the PEG reagent 
and vortexed to homogeneity. The reactions were allowed to proceed at 4^C overnight and found to be 
complete. 

Analysis on a 10-20% gradient SDS acrytamide gel with both reducing and non-reducing lanes and 
stained with Coomassie blue showed the mpAla17Cys37 PEG 5000 SPDP and mpAla17Cys37 PEG 5000 
SMCC products as nearly pure and running at about 28 kD. 

Non-PEGylated G-CSF mute in runs at approximately 19kD. A reducing lane on the gel showed that the 
SPDP conjugate is sensitive to reduction in the presence of DTT. The SMCC derived reagent was totally 
resistant to reductive treatment. The PEG 10.000 SPDP PEGylated mpAla17Cys37 G-CSF runs at 
approximately 32 kD. The non-PEGylated G-CSF mutein is regenerated by DTT treatment. 

EXAMPLE 10: PEGylation of the EPO mpCys9 mutein 

The mutein human EPO mpCys9 prepared in accordance with Example 5 above can be PEGylated 
using the stock PEGylation reagents prepared in Example 9 and the natural human EPO as a negative 
control. To initiate the reaction, 10 ul of the appropriate PEG stock as prepared in Example 9 is added to 
170 ul of the EPO mpCys9 stock at a concentration of lOOug/ml. The mixture is vortexed and the reaction 
is allowed to proceed at 4°C overnight Upon completion, the reaction can be analyzed as in Example 9. 

EXAMPLE 11: Screening of novel CAVs 

Having the constructed novel DNA molecules encoding CAVs in the appropriate expression vector and 
having attached the sulfhydryl reactive compound to the muteins, it may be desirable to produce each CAV 
55 protein on a small scale and "screen" for muteins which possess the desired attachment site or sites. The 
biological activity of each CAV. before and after attachment of the sulfhydryl reactive compound can be 
rapidly assessed using an in vitro assay. 
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Small scale bacterial production of IL-3 CAV muteins 

Bacterial strain Gi586 was transformed with purified plasmid DNAs consisting of bacterial expression 
vector, pAL-hlL3-781. ATCC Accession Number 67932, with novel CAV IL-3 coding sequences. The 

s transformed cells were spread on LB agar plates containing 50 ug/ml ampicillin at a density to yield 
approximately 100 colonies per plate. 3 ml of L broth plus 50 ug/ml ampicillin was inoculated with a single 
bacterial colony and grown overnight at 30 degrees centigrade. 50 ml of induction media (0.1 x L Broth, 
1xM9 salts. 0.4% glucose, ImM MgS04, 50 ug/ml ampicillin) was inoculated with 1 ml of the overnight 
culture. The 50 ml culture was grown with aeration at 30 degrees centigrade until an 0.5 OD 600 nm level 

10 was reached, then the temperature was shifted to 40 degrees centigrade and growth continued for at least 2 
hours. 

Ceils were then harvested by centrifugation at 3500 rpm for 5 minutes in a Sorval centrifuge with a 38 
rotor. The supernatant was discarded and the cell pellet resuspended in 1 ml of buffer PED (50 mM 
NaH2P0*. pH 7.0. 1 mM EDTA, 5mM DTT. 10 mM PMSF). This 1 ml solution was passed twice through a 

75 French Press at 10,000 psi and kept on ice. The solution was then microfuged for 5 minutes at 12,000 rpm. 
The supernatant was discarded and the pelleted material was resuspended in 150 ul of 7 M guanidine-HCI 
in PED buffer. The solution was then diluted with 650 ul of PED buffer and placed in dialysis tubing (10.000 
MWCO Spectrapore). The sample was dialyzed for at least 4 hours against 2 liters of PED.1 buffer (50 mM 
NaH2P04 pH 7.0. 1 mM EDTA, 0.1 mM DTT). The sample was collected and microfuged to remove 

20 precipitated proteins. The sample was then analyzed on a 12% Laemmli SDS PAGE gel and the amount of 
IL-3 protein estimated. 

The protein solution was then concentrated to about 0.5 mg/ml, and 200 ug was reacted with a 15 fold 
molar excess of either S-Pyridyl Monomethoxy PEG 5000 or Maleimido Monomethoxy PEG 5000 for 
several hours at 4 degrees centigrade. The products were then analyzed by SDS PAGE and biological 
25 activity detemiined by an in vitro TF-1 cell proliferation assay. 

This small scale production methodology may be similarly advantageously applied to production of the 
novel CAV G-CSFs and EPOs of the present invention. 

Alternatively, this small scale production for screening may be carried out before attachment of the 
sutfhydryl reactive compound. In that case, biological activity may still be determined by an in vitro TF-1 
30 cell proliferation assay (or in the case of G-CSF, a 32D cell proliferation assay) and the products may be 
analyzed by SDS PAGE analysis, in accordance with known techniques. 

The same or similar procedures may be used by one skilled in the art to attach other sulfhydryl reactive 
compounds to the other CAVs of the invention. Homogeneity can be observed by conventional analysis of 
the modified CAVs so produced e.g. using standard SDS-PAGE or HPLC analysis. 
35 Numerous modifications may be made by one skilled in the art to the methods and compositions of the 
present invention in view of the disclosure herein. Such modifications are believed to be encompassed by 
this invention as defined by the appended claims. 

Claims 

40 

1. A cysteine added variant ("CAV") of EPO. characterized in that said CAV comprises a peptide 
sequence of human EPO modified to contain at least one non-native cysteine residue at which residue 
said CAV is covalently attached to at least one sulfhydryl reactive compound. 

45 2. A CAV of claim 1, characterized in that the N-tenninus of the CAV commences with methionine and the 
native amino acid at position 1 of the mature peptide sequence is deleted. 

3. A CAV of any of claims 1 to 2, characterized in that said at least one sulfhydryl group is selected from 
one or more of the group consisting of dextran. a carbohydrate based polymer, such as a colominic 

50 acid, a polymer of an amino acid, biotin, and a polyalkylene glycol moiety. 

4. A CAV of any of claims 1 to 2, characterized in that said at least one sulfhydryl reactive compound is 
polyethylene glycol. 

55 5. A DNA sequence encoding a CAV comprising a peptide sequence of human EPO modified to contain 
at least one non-native cysteine residue for attachment of at least one sulfhydryl reactive compound. 

6. A host cell containing and capable of expressing a DNA sequence of claim 5. 
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7. A method of producing a CAV of any of claims 1 to 4, comprising covalently attaching at least one 
sulfhydryl reactive compound to at least one non-native cysteine of said CAV produced by culturing a 
host cell containing and capable of expressing a DNA sequence encoding said CAV. 

5 8. Use of a CAV of any of claims 1 to 4 for the preparation of pharmaceutical compositions suitable for 
stimulating hematopoiesis. 
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FIGURE 1 



1 10 
ATG OCT CCT ATG ACT CAA ACT ACT TCT TTA AAA ACT TCT 
Met Ala Pro Met Thr Gin Thr Thr Ser Leu Lys Thr Ser 

25 

TGG GTA AAC TGT TCT AAC ATG ATC GAT GAA ATT ATA ACA 
Trp Val Asn Cys Ser Asn Met lie Asp Glu lie lie Thr 



CAC TTA AAG CAG CCA CCT TTG CCC TTG CTG GAC TTC AAC 
His Leu Lys Gin Pro Pro Leu Pro Leu Leu Asp Phe Asn 

40 

AAC CTC AAT GGG GAA GAC CAA GAC ATT CTG ATG GAA AAT 
Asn Leu Asn Gly Glu Asp Gin Asp He Leu Met Glu Asn 

55 

AAC CTT CGA AGG CCA AAC CTG GAG GCA TTC AAC AGG GCT 
Asn Leu Arg Arg Pro Asn Leu Glu Ala Phe Asn Arg Ala 

70 

GTC AAG AGT CTG CAA AAT GCA TCA GCA ATT GAG AGC ATT 
Val Lys Ser Leu Gin Asn Ala Ser Ala He Glu Ser He 

85 

CTG AAA AAT CTG CTG CCA TGT CTG CCC CTG GCC ACA GCT 
He Lys Asn Leu Leu Pro Cys Leu Pro Leu Ala Thr Ala 

100 

GCA CCC ACC AGG CAT CCA ATC CAT ATC AAG GAT GGT GAC 
Ala Pro Thr Arg His Pro He His He Lys Asp Gly Asp 



115 



TGG AAT GAA TTC CGC CGC AAA CTG ACC TTC TAT CTG AAA 
Trp Asn Glu Phe Arg Arg Lys Leu Thr Phe Tyr Leu Lys 



ACC CTG GAG AAT GCT CAG GCT CAG CAG ACC ACC CTG AGC 
Thr Leu Glu Asn Ala Gin Ala Gin Gin Thr Thr Leu Ser 

130 

CTC GCG ATC TTC TAG 
Leu Ala He Phe Stop 
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FIGURE 2a 

ATG 
MET 

1 10 
ACC CCC CTG GGC CCT GCC AGC TCC CTG CCC CAG AGC TTC 
Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe 

20 

CTG CTC AAG TGC TTA GAG CAA GTG AGG AAG ATC CAG GGC 
Leu Leu Lys Cys Leu Glu Gin Val Arg Lys lie Gin Gly 

30 

GAT GGC GCA GCG CTC CAG GAG AAG CTG TGT GCC ACC TAC 
Asp Gly Ala Ala Leu Gin Glu Lys Leu Cys Ala Thr Tyr 

40 50 
AAG CTG TGC CAC CCC GAG GAG CTG GTG CTG CTC GGA CAC 
Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu Gly His 

60 

TCT CTG GGC ATC CCC TGG GCT CCC CTG AGC AGC TGC CCC 
Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys Pro 

70 

AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG AGC CAA CTC 
Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu 

80 90 
CAT AGC GGC CTT TTC CTC TAC CAG GGG CTC CTG CAG GCC 
His Ser Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala 

100 

CTG GAA GGG ATC TCC CCC GAG TTG GGT CCC ACC TTG GAC 
Leu Glu Gly lie Ser Pro Glu Leu Gly Pro Thr Leu Asp 

110 

ACA CTG CAG CTG GAC GTC GCC GAC TTT GCC ACC ACC ATC 
Thr Leu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr He 

120 130 
TGG CAG CAG ATG GAA GAA CTG GGA ATG GCC CCT GCC CTG 
Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu 

140 

CAG CCC ACC CAG GGT GCC ATG CCG GCC TTC GCC TCT GCT 
Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala 

150 

TTC CAG CGC CGG GCA GGA GGG GTC CTG GTT GCC TCC CAT 
Phe Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His 
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FIGURE 2b 



160 

CTG GAG AGO TTC CTG GAG GTG TOG TAG CGC GTT OTA CGC 
Leu Gin Ser Phe Leu Glu Val Ser Tyr Arg Val Leu Arg 

170 174 
CAC CTT GCC CAG CCC T 
His Leu Ala Gin Pro 
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FIGURE 3a 

1 10 
Met Ala Pro Pro Arg Leu lie Cys Asp Ser Arg Val 
5«ATATG GCA CCA CCA AGA TTA ATT TGT GAT TCT AGA GTA 
TAG CGT GOT GGT TCT AAT TAA ACA CTA AGA TCT CAT 



Leu Glu Arg Tyr Leu Leu 
TTA GAA CGG TAC CTC TTG 
AAT CTT GCC ATG GAG AAC 



40 

Asn lie Thr Val Pro Asp 
AAT ATC ACT GTC CCA GAC 
TTA TAG TGA CAG GGT CTG 



Trp Gin Gly Leu Ala Leu 
TGG CAG GGA TTA GCG CTA 
ACC GTC CCT AAT CGC GAT 



Pro Leu Gin Leu His Val 
CCC CTG CAG CTG CAT GTG 
GGG GAT GTC GAC GTA CAC 



Arg Ser Leu Thr Thr Leu 
CGC AGC CTC ACC ACT CTG 
GCG TCG GAG TGG TGA GAC 



Glu Ala Lys Glu Ala Glu Asn 
GAG GCC AAG GAG GCC GAG AAT 
CTC CGG TTC CTC CGG CTC TTA 



Cys Ser Leu Asn Glu 
TGC AGC TTG AAT GAG 
ACG TCG AAC TTA CTC 



Thr Lys Val Asn Phe Tyr Ala 
ACC AAA GTT AAC TTT TAC GCG 
TGG TTT CAA TTG AAA ATG CGC 



Gin Ala Val Glu Val 
CAG GCT GTA GAA GTA 
GTC CGA CAT CTT CAT 

70 

Leu Ser Glu Ala Val Leu Arg 
TTA AGT GAA GCT GTT CTC CGC 
AAT TCA CTT CGA CAA GAG GCG 

85 

Ser Gin Pro Trp Glu 
TCC CAG CCG TGG GAG 
AGG GTC GGC ACC CTC 

100 

Asp Lys Ala Val Ser Gly Leu 
GAT AAA GCC GTC AGT GGC CTT 
CTA TTT CGG CAG TCA CCG GAA 

115 

Leu Arg Ala Leu Gly Ala Gin 
CTT CGG GCT CTG GGA GCC CAG 
GAA GCC CGA GAC CCT CGG GTC 



25 
He Thr 
ATC ACG 
TAG TGC 



Thr Gly 
ACG GGC 
TGC CCG 



Cys Ala 
TGT GCT 
ACA CGA 



Glu His 
GAA CAC 
CTT GTG 



Trp Lys 
TGG AAA 
ACC TTT 



Arg Met 
AGA ATG 
TCT TAC 



55 
Glu Val 
GAA GTT 
CTT CAA 



Gly Gin 
GGC CAG 
CCG GTC 



Gly Gin Ala Leu Leu Val Asn Ser 
GGT CAG GCT TTA TTA GTC AAC TCT 
CCA GTC CGA AAT AAT CAG TTG AGA 
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FIGURE 3b 



Lys 


Glu 


Ala 


He 


Ser 


Pro 


Pro 


Asp 


Ala 


Ala 


Ser 


Ala 


Ala 


AAG 


GAA 


GCC 


ATC 


TCC 


CCT 


CCA 


GAT 


GCG 


GCC 


TCA 


GCT 


GCT 


TTC 


CTT 


CGG 


TAG 


AGG 


GGA 


GGT 


CTA 


CGC 
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Tyr 
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Lys 
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TTC 


CGA 
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TAG 


TCC 
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160 

Leu Tyr Thr Gly Glu Ala Cys Arg Thr Gly Asp Arg 
CTG TAC ACA GGG GAG GCC TGC AGG ACA GGG GAC AGA 
GAC ATG TGT CCC CTC CGG ACG TCC TGT CCC CTG TCT 

STOP 

TAA TAATGATAGGATCCT 

ATT ATTACTATCCTAGGAGATC - 5» 
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